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Empirical-likelihood-based confidence intervals for a mean were 
introduced by Owen [Biometrika 75 (1988) 237-249], where at least 
a finite second moment is required. This excludes some important 
distributions, for example, those in the domain of attraction of a 
^H , stable law with index between 1 and 2. In this article we use a method 

^ . similar to Qin and Wong [Scand. J. Statist. 23 (1996) 209-219] to 

derive an empirical-likelihood-based confidence interval for the mean 
when the underlying distribution has heavy tails. Our method can 
C^ ' easily be extended to obtain a confidence interval for any order of 

moment of a heavy-tailed distribution. 



in 
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1. Introduction. Suppose Xi, . . . ,Xn are i.i.d. random variables with 
^ ' common distribution function F satisfying 

S: l-F(x)=x-"«Li(x), 

in' (1-1) 

\0] F{-x)=x-^^L2{x), 

^ I where or > 1, ol > 1 and Li{x) and L2{x) are slowly varying functions, 

O ' that is, limt^oo Li(tx)/Li{t) = 1 and limt^oo L2{tx) / L2(t) = 1 for all x > 0. 

^H ! We are interested in obtaining a confidence interval for the mean fj, = E{Xi). 

In the case an = aL = Oi& {1,2) aiid\\uix^oo{Li{x))/{Li{x) + L2{x)) =p& 
[0, 1], we know that F lies in the domain of attraction of a stable law with 
index a. If both q/j and a^ are greater than 2, then F is in the domain 
^ ■ of attraction of a normal distribution. The use of heavy-tailed models in 

financial markets, such as value- at-risk in the context of risk management, 
;^ ■ is attracting more attention; see the book by Embrechts, Kliippelberg and 

Mikosch (1997). There are many other applications of heavy-tailed distribu- 
tions, for example, teletraffic data [see Resnick (1997)] and community size 
[see Feuerverger and Hall (1999)]. 
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2 L. PENG 

To estimate the population mean /x, a natural estimator is the sample 
mean X^ = ^ J27=i ^i- Since the type of limiting distribution of the sample 
mean, a stable law or a normal, depends on the tail indices ai and an, it 
is hard to derive a confidence interval for /i based on the limit, especially 
when the limit is a stable law with index between 1 and 2. To avoid distin- 
guishing the type of limit. Hall and LePage (1996) proposed a subsample 
bootstrap method to construct a confidence interval for // under more gen- 
eral conditions than (1.1). However, this method performed poorly in the 
case aL = an G (1,2) [see Hall and Jing (1998)]. More recently, Peng (2001) 
proposed a new estimator for fx that has a limit that is always a normal 
distribution by assuming aji = ol > 1 and second order regular variation 
conditions for 1 — F{x) and F{—x). This result prompted us to look at the 
empirical likelihood method. Empirical likelihood methods for constructing 
confidence regions were introduced by Owen (1988, 1990). One of their ad- 
vantages is that they enable the shape of a region, such as the degree of 
asymmetry in case of a confidence interval, to be determined automatically 
by the sample. In certain regular cases, empirical-likelihood-based confidence 
regions are Bartlett correctable; see Hall and La Scala (1990), DiCiccio, Hall 
and Romano (1991) and Chen and Hall (1993). For more details on empirical 
likelihood methods, see Owen (2001). 

We organize this article as follows. In Section 2 we introduce our empirical 
likelihood method and state our main results. A simulation study and a real 
application are given in Section 3. All proofs are deferred to Section 4. 

2. Methodology and main results. The key idea of our method is to 
employ extreme value theory to deal with two tails of the underlying distri- 
bution and to treat the middle part of the underlying distribution nonpara- 
metrically. Since this approach involves estimation of tail index, we know we 
need a stricter condition than (1.1) to investigate the asymptotic behavior 
of the tail index estimator. For simplicity we assume, as j; — > oo, 

1 - F{x) = CRX^"«{1 + bRX-^'' + o(x~^«)}, 
(2.1) 

F{-x) = CLX~°"^{1 + 6lx-^^ + o(x-^^)}, 

where cr > 0, cl > 0, bfi / 0, 6^ / 0, /3ij > and Pl > 0. We remark that 
(2.1) is a special case of a general second-order regular variation [see de 
Haan and Stadtmiiller (1996)]. 

Put 5i = I{Xi < Xn,m) and 5i = I{Xi > Xn,n-k+i), where X„,i < • • • < Xn,n 
denote the order statistics of Xi, . . . ,Xn, m = m{n) — > oo, m/n — > 0, A: = 
k{n) — > oo, k/n — > 0, Xn,m < and Xn^n-k+i > 0. We may approximate the 
two tails of F by 

F{x) = l-CRX~'^^, \i X > Xn,n-k+l, 
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and 

Thus the log-hkehhood function may be written as two parts, the parametric 
log-hkehhood based on data in two tails and the nonparametric hkehhood 
based on data in the midrange, that is, 



where 



lo{cR,CL,aR,aL,Pi) = li{cR,CL,aR,aL) + l2{Pi), 



h{cR,CL,aR,aL) 

= klogaR + klogCR- (oR + l) ^ logXi 

+ mlogaL + m log c^ - (a^ + 1) ^ log{-Xi 

<5,; = 1 



and 



By considering the log- likelihood function lo{cR,CL,aR,aL,Pi) subject to 
the constraints 

(2.2) aR>l, aL>l, cr>0, cl > 0, Pi>0, 

(2.3) Yl Pi = ^-CRX-^^,^,-cU-Xn,r 
Si=Si=0 



-OIL 
.,mj ) 



(2.4) Y: P«^^=/^-C«-^<n"4+l-CL-^(-^n,^)'~"S 

e e r^ ^R ^ ^L ^ 

we can employ the semiparametric likelihood ratio method, as in Qin and 
Wong (1996), to obtain a confidence interval for //. The details are given in 
the following discussion. 

First we maximize /q subject to constraints (2.2) and (2.3). By the method 
of Lagrange multipliers, we easily obtain 

-1 



(2.5) 



aR = 




aL = 




CR = 
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r,-!I!:(-x r^ 

n 

Pi = n~^, [i5i = 5i=0. 

Note that both or and ai are Hill estimators [see Hill (1975)]. 

Next we maximize Iq subject to constraints (2.2)-(2.4) for fixed fj,. Define 

Wi = Wi{fi,CR,CL,aR,aL) 

= Xi-{fi- CRaniaR - l)"^X^;;,"^+i 



1 - CiJ^„,„f!fc+i - CL{-Xn,my 



OIL' 



Cij(aij,/i) = -X^«_^+^ 



Elog 



X,, 



k 



^Si=l 



Xn,n-k+l CUR 



x{aR{aR-l)n ^X^l_^^^ 
-^^{aR-lfn-^X:^ZU,], 

TTl 



n 






Xi m 1 

\,m CHI, J 

x{aL{aL-l)n-'i-Xn,mr'^ 



X2{aR,fJ-) = {kn ^(OR-I) '^Xn,n-k+l 






Xi 



Xn,n-k+l CX.R 



n ^aij(aR-l) ^Xn,n-k+i + n V 



A2(aL,^) = <mn ^(a^ - 1) ^(-X„,m) 









-1 



-?z ^aL(aL-l) ^(-X„,m) + n V 



Ai(aR, aL,/i) = {n-m-k) 
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X {1 -c,j(aR,/.)X-°«,^, -Ci(aL,/i)(-X„„„)~"^}-i. 
Let aji = a:ji{n) and ol = O-L^ij) denote the solutions to the equations 

= X! {wi{fJ',CR{aR,n),CL{aL,fJ.),aR,aL)) 
Si=Si=0 

(2.6) X (Ai(aij,aL,/i) 

+ A2(aij, fJ.)wi{iJ,, CR{aR, n),CL{aL,lJ.),aR, ol))^ 
= 
and 

h{aR,aL,fJ.) 

= X! {'Wi{f^jCR{aR,^i),CL{aL,fJ.),aR,aL)) 

(2.7) X (Ai(aR,aL,/u) 

+ X2{aL,fJ.)'Wi{^,CR{aR,n),CL{aL,fJ-),aR,aL))~ 
= 0. 
(The existence is given in Lemma 3 in Section 4.) Hence 
CR = CR{aR,fj.), 

(2.8) Ai = Ai(aR,aL,^), 
A2 = A2(aR,/i), 

Pi = {Ai + X2Wi{^i,CR,CL,aR,aL)}~^, ii6i = 6i = 

are the values which maximize Iq subject to (2.2)-(2.4). (The proof is given 
in Section 4.) Therefore, the semiparametric likelihood ratio multiplied by —2 
is defined as 

l{fx) = -2{lQ{cR,CL,aR,aL,Pi) - loicR,CL,aR,CL,Pi)}. 
Our main results are as follows. 

Theorem 1. Suppose (2.1) holds. Choose k = o(n2/3fl/("H+2/3fl))^ A;/logn - 
oo, m = o{v?^^'^'^'^~^'^l^^>) and m/logn — > oo as n ^ oo. Further assume 
Bqv}° <m,k< Biv}^ for some -Bq > 0, i?i > 0, < /q < ^i < 1 and 

lim 

n^°° (fc/n)^-2/"fl + (m/n)i^2/QL 
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exists. Let the true mean he hq. Then 

Based on Theorem 1, a simple approach to construct a 1 — a level confi- 
dence interval for fiQ is 

where di-a is the 1 — a quantile of a xfu distribution. This gives a confidence 
interval for fiQ with asymptotically correct coverage probability 1 — a, as 
stated in the following corollary. 

Corollary 1. Assume the conditions in Theorem 1 hold. Then as n^> 

oo, 

P{fioeli^a) = l-a + o{l). 

Remark 1. Our method can easily be extended to obtain a confidence 
interval for any order of moment of a heavy-tailed distribution. 

Remark 2. The choices of sample fractions k and m are difficult both 
theoretically and practically. This will be a part of our future research plan. 
However, our simulation study in Section 3 shows that this approach is 
robust against the choices of sample fraction. 

Remark 3. An empirical likelihood estimator for fi can be obtained 
as /z = argmin^/(/^), which can be shown to give fi — fi = Op{ ^^"^'^ ), 
where o"(s,t) is defined in Lemma 1. That is, the convergence rate of ft is 
the same as that of the estimator proposed by Peng (2001). Indeed, fi has 
the same asymptotic behavior as the estimator in Peng (2001) since no side 
information is involved in our empirical likelihood method. 

3. A simulation study and a real application. For simplicity, our simula- 
tion study and real application only deal with the right tail by assuming that 
the left endpoint of the underlying distribution is finite. Hence we replace 
CL(aL,/u), cl and Xn,m by zero and remove (2.7) in our empirical likelihood 
method described in Section 2. 

3.1. A simulation study. In this section we conduct a simulation study 
to investigate the coverage accuracy of our proposed empirical-likelihood- 
based confidence interval for a mean, and then we compare it with the 
normal approximation method and the subsample bootstrap method which 
are given below. 



Define 
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1 " 



Xn — — y Xi, 

n ^ 
1=1 

Tn = \/n{Xn - fl)/Sn- 

Without verifying the finiteness of variance, a confidence interval with nom- 
inal level 1 — a based on normal approximation can be obtained as 

I^-a = i^n - Za/2Sn/Vn,Xn + Z^/2Sn/Vn), 

where Za/2 satisfies P(|A^(0, 1)| > -Zq/2) = «• The subsample bootstrap method 
was proposed by Hall and LePage (1996) as follows. Conditional on Xi, . . . , Xn, 
let XI, . . . , X*^ denote independent and identically distributed random vari- 
ables drawn randomly, with replacement, from Xi, . . . ,Xn- Put 



1 



ni 



X* 


= -yx:, 

"1 ^=i 




\ 1=1 


T* 


^{Xl^-Xn) 



1/2 



and 

xi_Q,=sup{2;:P(|T^*J < x|Xi, . . . ,X„) < 1 - a}. 
Then a nominal 1 — a level confidence interval for ^ is 

Il-a = {Xn - Xl-aSn/Vn,Xn + Xi-aSn/Vn), 



which has asymptotic coverage probability 1 — q under very mild regularity 
conditions, including ni ^ oo and rii/n — > as n — > cxd [see Hall and LePage 
(1996) for details]. 

We generated 500 pseudorandom samples of size n = 1000 from one of 
the two distributions: (1) a Burr(aR,/3/j + aR) distribution, given by F{x) = 
1 - (1 + x^«)~°'«/^«, x>0; (2) a Prechet(aR) distribution, given by F{x) = 
exp{— x~"^}, X > 0. For the subsample bootstrap method we drew 1000 
resamples each time. 
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First we compare our empirical likelihood method with the normal ap- 
proximation method and the subsample bootstrap method in terms of cov- 
erage probability by employing a practical choice of fc = [k* /\og{k*)] = rii 
in view of Theorem 1, where 

(3.1) 

minimizes the mean squared error of the Hill estimator [see Hall and Welsh 
(1985) or de Haan and Peng (1998)]. Here we use the theoretical value of k* 
rather than the estimated value, since we investigate the effect of the choice 
of sample fraction in our next comparison. Coverage probabilities based on 
these three methods are reported in Table 1. We can conclude from Table 1 
that our empirical likelihood method is better than the other two methods 
when the index an is near 2. 

Second, we compare our empirical likelihood method with the normal 
approximation method and the subsample bootstrap method in terms of 
coverage probability by employing different choices of sample fraction for 
distributions Frechet(2.0) and Burr(2. 0,4.0); see Tables 2 and 3. These two 
tables show that the advantage of our empirical likelihood method is robust 
against the choice of sample fraction. 

Third, we investigate the lengths of confidence intervals based on these 
three methods by employing the practical choice oi k = [fc*/log(A;*)] = ni 

Table 1 
Coverage probabilities by employing a practical choice of sample fraction 

Io.90 logo lo.go Jo. 95 -fo.95 -fo.95 fe = ni 



Frechet(1.5) 


0.894 


0.888 


0.728 


0.944 


0.950 


0.782 


37 


Frechet(1.8) 


0.900 


0.926 


0.830 


0.952 


0.970 


0.868 


37 


Frechet(2.0) 


0.910 


0.936 


0.850 


0.954 


0.980 


0.902 


37 


Frechet(2.2) 


0.910 


0.940 


0.874 


0.956 


0.978 


0.920 


37 


Frechet(3.0) 


0.908 


0.940 


0.902 


0.968 


0.984 


0.932 


37 


Frechet(5.0) 


0.914 


0.922 


0.896 


0.968 


0.980 


0.946 


37 


Burr(1.5,3.0) 


0.886 


0.906 


0.728 


0.946 


0.962 


0.784 


25 


Burr(1.8,3.6) 


0.894 


0.938 


0.834 


0.954 


0.980 


0.868 


25 


Burr(2.0,4.0) 


0.896 


0.946 


0.854 


0.952 


0.982 


0.900 


25 


Burr(2.2,4.4) 


0.910 


0.950 


0.874 


0.956 


0.984 


0.924 


25 


Burr(3.0,6.0) 


0.906 


0.944 


0.900 


0.960 


0.988 


0.936 


25 


Burr(5.0, 10.0) 


0.910 


0.936 


0.904 


0.966 


0.976 


0.942 


25 



Note: We report the coverage probabilities for confidence intervals 
based on our empirical likelihood method, the normal approximation 
method and the subsample bootstrap method with confidence levels 
0.90 and 0.95. We choose k = [fc*/log(fc*)] = ni, where k* is defined 
in (3.1). 
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Table 2 

Coverage probabilities by employing different choices of sample 

fraction for Frechet(2.0) 



fc = ni 


logo 


Jo. 90 


-'0.90 


Jo. 95 


Jo. 95 


Jo. 95 


20 


0.900 


0.956 


0.850 


0.950 


0.988 


0.902 


22 


0.900 


0.956 


0.850 


0.946 


0.990 


0.902 


24 


0.898 


0.950 


0.850 


0.958 


0.986 


0.902 


26 


0.892 


0.946 


0.850 


0.954 


0.988 


0.902 


28 


0.898 


0.940 


0.850 


0.956 


0.982 


0.902 


30 


0.904 


0.936 


0.850 


0.956 


0.980 


0.902 


32 


0.898 


0.936 


0.850 


0.958 


0.980 


0.902 


34 


0.918 


0.936 


0.850 


0.952 


0.980 


0.902 


36 


0.908 


0.936 


0.850 


0.954 


0.980 


0.902 


38 


0.908 


0.934 


0.850 


0.958 


0.974 


0.902 


40 


0.912 


0.932 


0.850 


0.958 


0.976 


0.902 


42 


0.906 


0.934 


0.850 


0.954 


0.972 


0.902 


44 


0.906 


0.930 


0.850 


0.958 


0.968 


0.902 


46 


0.900 


0.930 


0.850 


0.960 


0.968 


0.902 


48 


0.898 


0.930 


0.850 


0.962 


0.966 


0.902 


50 


0.900 


0.926 


0.850 


0.962 


0.966 


0.902 


52 


0.902 


0.928 


0.850 


0.958 


0.964 


0.902 


54 


0.904 


0.926 


0.850 


0.958 


0.964 


0.902 


56 


0.906 


0.926 


0.850 


0.958 


0.962 


0.902 


58 


0.906 


0.926 


0.850 


0.960 


0.960 


0.902 


60 


0.908 


0.922 


0.850 


0.954 


0.960 


0.902 



Note: We report the coverage probabilities for confidence in- 
tervals based on our empirical likelihood method, the normal 
approximation method and the subsample bootstrap method 
with confidence levels 0.90 and 0.95. The underlying distri- 
bution is taken as Frechet(2.0). 



given above. We took confidence level 90% and considered distributions 
Frecliet(1.8), Frechet(2.0), Frechet(2.2), Burr(1.8,3.6), Burr(2.0,4.0) and Burr(2.2,4.4). 
Since we found that our empirical likelihood method gave a very large value 
for the right endpoint from time to time, we report in Table 4 the median 
of left and right endpoints of the confidence intervals based on these three 
methods. Note that the distributions considered above are skewed to the 
right and our empirical likelihood method clearly shows this property, that 
is, with a larger median of endpoints. This also explains why our empirical 
likelihood method is better than the other two methods in terms of coverage 
probability when the tail index is near 2. Moreover, our box plots for the 
endpoints, which are not presented here, show that the symmetric intervals 
based on the subsample bootstrap method are not good since there are some 
left endpoints have negative values when or < 2. 
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Table 3 

Coverage probabilities by employing different choices of sample 

fraction for Burr(2. 0,4.0) 



fc = ni 


-fo.90 


J'^o.go 


jN 

^0.90 


Jo. 95 


Jo. 95 


-•O.OB 


20 


0.904 


0.948 


0.854 


0.952 


0.986 


0.900 


22 


0.902 


0.948 


0.854 


0.946 


0.984 


0.900 


24 


0.896 


0.948 


0.854 


0.954 


0.986 


0.900 


26 


0.894 


0.946 


0.854 


0.952 


0.982 


0.900 


28 


0.900 


0.938 


0.854 


0.954 


0.978 


0.900 


30 


0.906 


0.938 


0.854 


0.956 


0.976 


0.900 


32 


0.896 


0.936 


0.854 


0.956 


0.978 


0.900 


34 


0.916 


0.936 


0.854 


0.952 


0.976 


0.900 


36 


0.908 


0.932 


0.854 


0.956 


0.978 


0.900 


38 


0.910 


0.932 


0.854 


0.958 


0.974 


0.900 


40 


0.916 


0.932 


0.854 


0.958 


0.970 


0.900 


42 


0.908 


0.934 


0.854 


0.952 


0.970 


0.900 


44 


0.908 


0.932 


0.854 


0.960 


0.970 


0.900 


46 


0.900 


0.930 


0.854 


0.958 


0.968 


0.900 


48 


0.898 


0.926 


0.854 


0.962 


0.964 


0.900 


50 


0.904 


0.930 


0.854 


0.960 


0.966 


0.900 


52 


0.900 


0.930 


0.854 


0.958 


0.964 


0.900 


54 


0.900 


0.926 


0.854 


0.958 


0.962 


0.900 


56 


0.906 


0.922 


0.854 


0.960 


0.962 


0.900 


58 


0.906 


0.922 


0.854 


0.958 


0.960 


0.900 


60 


0.912 


0.922 


0.854 


0.956 


0.962 


0.900 



Note: We report the coverage probabilities for confidence in- 
tervals based on our empirical likelihood method, the normal 
approximation method and the subsample bootstrap method 
with confidence levels 0.90 and 0.95. The underlying distribu- 
tion is taken as Burr(2. 0,4.0). 

3.2. A real application. The data set we analyzed consists of 2156 Dan- 
ish fire losses over one million Danish krone from the years 1980 to 1990 
inclusive (see Figure 1). The loss figure is a total loss figure for the event 
concerned and includes damage to buildings, furnishings and personal prop- 
erty as well as loss of profits. This Danish fire data set was analyzed by 
McNeil (1997) and Resnick (1996), where the right tail index was confirmed 
to be between 1 and 2. Here we try to find a confidence interval for the 
mean value of the Danish fire loss. In Figure 2 we plot the endpoints of the 
confidence intervals based on the normal approximation method, our empir- 
ical likelihood method and the subsample bootstrap method with various 
choices of sample fraction k and subsample size ni. We observe that the 
empirical-likelihood-based confidence interval has larger values than the in- 
terval constructed from the subsample bootstrap method in most cases. The 
interval based on normal approximation has a significantly smaller length 
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Table 4 
Median of endpoints of confidence intervals with confidence level 90% 





MoLE- 


MoLE- 


MoLE- 


MoRE 


MoRE- 


MoRE 




SBM 


ELM 


NAM 


SBM 


ELM 


NAM 


Frechet(1.8) 


1.635 


1.853 


1.782 


2.275 


2.428 


2.137 


Frechet(2.0) 


1.547 


1.683 


1.633 


1.961 


2.013 


1.884 


Frechet(2.2) 


1.472 


1.559 


1.527 


1.765 


1.789 


1.717 


Burr(1.8,3.6) 


1.394 


1.632 


1.559 


2.080 


2.242 


1.920 


Burr(2.0,4.0) 


1.333 


1.471 


1.428 


1.770 


1.816 


1.686 


Burr(2.2,4.4) 


1.278 


1.373 


1.336 


1.586 


1.603 


1.533 



Note: We report the median of both left and right endpoints of the confidence intervals 
based on the subsample bootstrap method (SBM), the empirical likelihood method (ELM) 
and the normal approximation method (NAM) with confidence level 90%. We use MoLE- 
SBM and MoRE-SBM to denote the median of left and right endpoints of the confidence 
intervals based on the subsample bootstrap method, respectively; similarly for notation 
MoLE-ELM, MoRE-ELM, MoLE-NAM and MoRE-NAM. 



than the other two types of intervals due to infinite variance. Moreover, our 
empirical likelihood method clearly demonstrates the skewness of the data 
to the right and gives asymmetric confidence intervals. As we mentioned 
in Remark 2, the choice of sample fraction is difficult and very important. 
We hope to be able to carry on our research on this issue in the future. 

4. Proofs. 

Proof of (2.8). Put 

91 =gi{cR,CL,aR,aL) = 1 - CrX'^'^j^j^-^ - CL{-Xn,mr"''' , 

92 = g2{fJ',CR,CL,aR,aL) 

-a Cf. "^ .Y^-°« c; "^ { X )i-"^ 

H{n,CR,CL,aR,aL,XiA2,Pi) 

= klogUR + klogCR - {or + 1) X! log^« 

s,=i 

+ mlogaL + mlogCL - {ai + 1) X] l°g^* 



+ Yl logpi-AJ X Pi-gi\->^2 Yl P' 

5i=Si=0 U,=5,=0 J 5i=S,=0 



Wi. 
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Danish Data 
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200 400 600 800 1000 1200 1400 1600 1800 2000 

Fig. 1. Danish fire loss data, which consists of 2156 losses over one million Danish krone 
for the years 1980 to 1990 inclusive. 



By the method of Lagrange multiphers we have 
k 

CR 






(4.1) 



(4.2) 



A. 



J ct-R yi-aR ^-i„ y-'^R I— n 

:i^ _'^^n,n-k+l~ 9l fi'2A„ „_fc_^i I - U, 



m 



Al(— A„rn,) 

CL 



I ai - 1 ) 






OR 



Si=l 



(4.3) +Xj 7-^X^-"4+l + ^^K7-Ul l°g ^n,n-k+l 

I [oiR - i-r oiR- I 

- 9l^ 92CRX~'^'tk+^ log Xn,n^k+l | = 0, 
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Fig. 2. Confidence intervals for the mean value of Danish fire loss data. The end- 
points of confidence intervals with level 95% based on our empirical likelihood method, 
the normal approximation method and the subsample bootstrap method are plotted against 
k = ni = 20, 30, 40, . . . , 210. Here NAM, ELM and SBM denote the normal approximation 
method, our empirical likelihood method and the subsample bootstrap method, respectively. 



(4.4) 



777 



OIL 



Si=l 



CL 



iaL-iy' '''"'' 

+ -^^{-Xn,mf-"^ log(-X„,„) 

- 5r'52CL(-Xn,m)-"^ log(-X„,„)| = 0, 



(4.5) 

(4.6) 
(4.7) 



Pi = {Ai + A2Wi} \ if(5i = 5i = 0, 



E 

E 



1 

Ai + \2Wi 
Ai + \2Wi 



91, 
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By (4.1) and (4.3), 
k 



(4.8) — - E l°g -^^^^- + r^^ "'"°^ 



+ (a„_l)2^n,n-fc+l-U- 



By (4.2) and (4.4), 



Xi , X2CL 



("■») :?^-EiosT^ + 7-^^(-^».».)'-"' = o- 



"L f^^ Xn,m {oiL - 1 



By (4.5)-(4.7), 



Ai 



(4.10) 



n — m — k 
91 



= {n-m- k){l - crX;^Z%^, - cl(-X„,^)-"^}-^ 
By (4.1), (4.2) and (4.10), 

(4.11) A25'r^ff2 = A2/X + Ai -n. 

By (4.1) and (4.11), 



(4-12) ^ - ^Xl-:_l^, + \2l^X--_,^, - nX-l 

By (4.2) and (4.11), 
m A2 aL 



Ofl —0 

fc+i ~^- 



(4.13) 

By (4.8) and (4.12), 



(-^n,m)'-"^ +A2;u(-X„,„0-"^ -n(-X„,^)— ^ = 0. 



X2 = \kn ^{aR-1) '^Xn,n-k+i 



El°g 



k 



(4.14) 



&^ = l 



Xn,n~k+l CtR 



-n ^aR{aR-l) ^Xn,n-k+i + n V 



and 



(4.15) 



CR = 


" ^^ri,n-k+l 






-{tmy^ — 

/ Xn,n-k+l 


k 

OiR 



2^-1 v"fl-l 



X {aRiaR - l)n-'X:-_,^, - f,{aR - l)^n-^X„X.+i}- 
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By (4.9) and (4.13), 



(4.16) 



and 



v^ , Xi m 

2^ log — 

5,=1 '^'™ "^. 



n ^aL(QL-l) ^{-Xn,m)+n Vf 



Cr-—(-X 1"^ 

n 

^{aL{aL - l)n-\-Xn,mT^ - fi{aL - l)2n-i(-X„,„)"^-i}. 

So (2.6) follows from (4.7), (4.10), (4.14), (4.15) and (4.17). Similarly (2.7) 
follows from (4.7), (4.10) and (4.15)-(4.17). Hence (2.8). D 

Before we prove Theorem 1, we need some notation and several lemmas. 
Let F~ denote the inverse function of F. Then (2.1) implies that 

F-{l-x) = cV"«x-i/"«{l + dRxf^ + o{xf^)}, 
(4.18) 

F-{x) = -c}^/"^x-^/"^{l + dLxP^ + o{xP^)} 
as X ^ 0, where pR = (in/aR, cLr = bRC^^ /aR, pL = (iL/o-L and di = 

Let Ui,U2,--- be i.i.d. random variables with a uniform distribution on 
(0,1) and let Un,i < ••• < Un,n denote the order statistics of C/i,...,C/„. 
Define Gn{u) = \Y^^=iI{Ui ^ ^)- Then from Csorgo, Csorgo, Horvath and 
Mason (1986) there exists a sequence of Brownian bridges Bn{u), < n < 1, 
n = 1, 2, . . . , such that 

(4.19) sup n>- l'^'f"W-';i; ^"'"'1 =0,(1) 

l/n<u<l~l/n (u(1-m))V^ l^ 

as n -^ 00, where p is any fixed number such that p E [0, 1/4). Without loss 
of generality we assume Xnj = F~{Unj), j = l,2,...,n. We use po, crq, 
clO) ctRO and olo to denote the true parameters under model (1.1). 

Lemma 1. Put 

rl-t rl-t 

cj2(s,t)= / / {uAv-uv)dF-{u)dF-{v). 
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Suppose (2.1) holds. Then 

( j-l-k/n 



n /m k 
n n 



va/n 



[F-{s)Yds, 






^xl-2/a«/ 1 

^^1-2/Qfl 

n 



ifaR>2,aL > 2, 
«/aiJ > 2, az, < 2, 

+ / [F-{s)]^ds, 

1/2 

ifaR = 2,aL < 2, 



«/ai? <2,aL > 2, 



^x 1-2/qh 



4 + ^^—)+/ [F'is)fds, 

2- Or J Jm/n 

if CUR < 2,aL = 2, 



n 



Cr + 



1 



2-aRj \n 



m 



l~2/aL 



cl + 



1 



2-aLj' 
ifaR<2,aL < 2. 

Moreover, if Bqit}^ <m, k < BirJ-^ for some Bq > 0, Bi > 0, < Iq < h < 1, 
then 

rl—k/n 

/ [F-(s)]2ds/(m/n)^-2/"^ ^0 incase aR = 2,aL<2, 

Jl/2 



1/5 

m/n 



[F"(s)]2(is/(A:/n)^-2/"«^0 incase ai?<2,aL = 2. 



Proof. It is easy to check that 



a 



m k\ m 



n n 



n 



F^ 



m 



n 



m\' 
n J 



F' 



m 



n 



l2 



2—F-i — / F-{s)ds 

n \n J Jm./n 



k 


■ 


f 


^M 


2 


(k\^ 




( 


kW 


+ - 


F 


1- 


-- 


— 


- 


F 


1- 


-- 


n 




v 


n)\ 




\n) 




V 


n)\ 



n \ n J Jm/n 



+ 



1-k/n 
m/n 



l2 



F~(s] 



ds 



l-k/n -|2 

F^{s)ds . 

m/m 



The rest of the proof follows by applying Potter's bounds [see Geluk and de 
Haan (1987)] to (4.18) in a similar way to the proof of Lemma 2.5 in Csorgo, 
Haeusler and Mason (1988). D 
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Lemma 2. Suppose the conditions in Theorem 1 hold. Then the following 
approximations are true: 

Vk{aR - am} 

A; "V nj \l k Ji-k/n 1 -s 



"™\/t-^"( 1~ r ) +'^™a/t / ~ -ds + Op{l), 



fW^ /m\ fW f"^/'' Bjs) , 

aLo\—Bn[ — ]-aLo\— ds + o.p{l), 

\ m \n J y m Jq s 






rr\ CR{am,fJ'o) _ -, 
cm 

n / k 
/c " \ n 



<[^A^-^)-^JU.m- 



X <^ aBo{am - 1) - /^o(a/?o 



I)^c^y"™(^)'^"°}+«P(1), 



m< 1 

I CLO 

771 V n 



' V w. V ?^ / y m Jo s 



X <{ aLo(aLO - 1) - Aio(aLO - l)^c^o^"''" ( 3 ) }> + Op(l), 



n 



A,'°"°-''°'^oir^V'°"\-/' 



! =Op — m ' 

n \\n J 
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Ai = 1 + Op[l), 

n 



n I 1 



a(m/n, k/n) | n - 



J2 X,- F-{s)ds\ 



i-^/" i?„(g)Q;F-(.) ^ ^ 

'm/n a{m/k, k/n) ^ 
n f 1 



^ ■u^j(Mo,CRO,ci,o,aRo,aLo) ^ +Op(l)- 
=A.=n J 



Proof. Using Theorems 2.3 and 2.4 of Csorgo and Mason (1985), we 
have 



(4.20) 



(4.21) 
Similarly 

(4.22) 



\/k\ - log(l - Un,n-k+l) + log \ 

fi.„(l-^)+o,(l). 



m \n J y m Jo s 



(4.23) 



\/m<^ - log Un,m + log — ^ 



''bJ-]+o,{1). 
m \n 



Hence the lemma follows from (4.18)-(4.23) and Lemma 1. □ 

Lemma 3. Suppose the conditions in Theorem 1 hold. Then there ex- 
ist solutions {aR{fio),aL{fJ-o)) to (2.6) and (2.7) such that a/j(/xo) — a_Ro = 
Op{5g) and aL(^o) — olo = Op{6q), where 6q = 0{k~'^ + m~'^) for any q G 

[1/3,1/2). 
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Proof. By Taylor expansion we can easily show that 

CR{aRO ± ^qjfJ-o) = CRO + Op{6q), 
CL{aLO±^q,fJ'0) =CL0 + Op{6q), 

A2(aL0 ± Sq,fJ-o) = n{aLo - 1)^"lo (~^",m)~H±^g + Op(m"^/^)}, 
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h{aRo±6q,aL,^^o) 



2 -t^l-ajjo 



■(«™-i)-xrn-T+i 



- y2 Wi{iJ.o,CRo,CL,aRo,aL) 



n - 

Si=Si=0 

x{±6q + 0p{k-'/^)}, 



{URO - 1) aM^n,n-fc+l 



h{aRo,aLo±5g,^lo) = -{aw - l)-'(-X„„„)i-"^«{±'^<? + Op{m-^/^)}, 
h{aR,±5q,aL,fio) = -(am - l)-'^^7„'!.fVi{±'^<? + Op{k~^^^)}, 



h{aR,aLo±Sq,fJ'o) 



1 



n 



X] Wi{l-i'0,CR,CLo,aR,aLo) 



Si=Si=0 



x{±6q + Op{m-^/^)}. 



("LO - 1) aLo(~^n ^ 



*L0 V ^^n,ml 



Hence the lemma follows from the above expansions and 

F~(?TT-/?l) 



n,n-k+l _ -I _ f) I 

F-{l-k/n) '-""AVk 



D 



Proof of Theorem 1. Let r] = {cR,CL,aR,aL,Pi)- The key idea in the 
proof is to expand lo{fj) around fj and to derive the convergence rates for 
OiR - aR, ai - OiL, CR - CR and cl - cl- 
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Using Lemma 3 we can easily prove that 






fc+i , 

1 



+ {oLm-lfa^{aR-aRQ) + Opy rr j n 



M{aL, lio) = n{-Xn,m) M (aLO-l)M — Y^ log^^^^ «^o 



'^ A.=i ^?i,m 



a, 



+ ("LO - l)^aLo («i ~ «io) + Op 

It follows from (2.6) and (2.7) that 

X ,-. ^ Ai(Qij,,aL,/io)Es,=5,=o^i(/^o,CR,CL,aR,aL) 



(4.24) 



S5,=5,= 


=o^2^(/^o,CR,CL,aR,aL) 


X{l + 0p(l)} 




na — ) — r 
\n nJ 


- ) \ Wi{^Q,CRo,CLo,aRo,c 


X{l+0p(l)}, 





V n re/ n - "^ 



<5i=<5,=0 
X{l+0p(l)}. 



Hence 



r 2^ log T " 

"^ J , ^n,n— fc+l 



"R - afto = -Oijo "^ r X! 1°S ^^ ^ " 



i?0 

/\ ^ ^ ;^ I 1 



(4.25) 



. , 99 9/ ^^ \ 



X - y] w^i(/io,Ci?o,CLo,ai?o,aLo) 
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and 



aL - aio = -alol — y^ log 



"^ 5^=1 ■^■^^^ 



a 



LO 



(4.26) 



* n n 



X - y" Wi(/"o,CRo,CLo,a/?o,aLo) 
n - ^-^ 



+ 0k 



1 



Furthermore 



(4.27) 



and 



CRiaB^fJ-o) CR{ano,fio) _i 



cro 



cro 



(4.^ 



cz,(aL,/io) CL(aLo,/^o) 



OiRoiaRO - l)(afl - a/?o)(l + Op(l)) 



«Lo(«M - l)(aL - aLo)(l + Op(l))- 



CLO CLO 

Expanding /o('?) at f/ by noting that f] is the maximum hkeUhood estimate, 
we have 

{1 d'^l 
■^Tr^{cR,CL,aR, aL,Pi)[aR - or]'^ 
R 



+ -^-^{cR,CL,aR,aL,Pi)[aL - aiY 

, 19^^0 ._ _ _ _ _ w. _ i2 

+ i:-^[cR,CL,aR,aL,Pi)[cR- cr\ 

Id lo ._ _ _ _ _ w. _ i2 

+ 2"^"^(cR'CL,aR,aL,Pi)[cL - cl\ 



n 



2a'^[m/n,k/n) 



-\2 



- y] 'Wi{fio,CR,CL,aR,aL) 



5i=Si=0 
-2< -2~'^ka]^l[aR - aRf - 2~^kall[aL - aif 



Op{l) 



2-^kc^l[cR -CRf- 2-'kcll[cL - CLf 



"ROi 
n 



2a'^(m/n, k/n) 



n2 



- y] f^i(^o,Cijo,CLo,aM,aLo) 
re- ^-^ 



&i=Si=0 



+ Op{l) 
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It follows from Lemma 2 and (4.24)-(4.28) that 

k[aR-aRf = Op{l), 

k[aL -aif = Op{l), 

k[cR-CRf = Op{l), 

k[cL - cif = Op(l) 



and 

'n 






.iV(o,l) 



a{m/n, k/n) 

Ui Uj — u 

Hence the asymptotic limit of /(/io) is X^(l)- O 
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